Asymptotically exact heuristics for prime divisors 
of the sequence {a^ -\- b^}^^i 



Pieter Moree 



Abstract 

Let Nafi{x) count the number of primes p < x with p dividing a'^ + 
for some k > 1. It is known that Na^b{x) ~ c{a,b)x/ log x for some ratio- 
nal number c(a, b) that depends in a rather intricate way on a and b. A 
simple heuristic formula for iVa^b(x) is proposed and it is proved that it is 
asymptotically exact, i.e. has the same asymptotic behaviour as Nafi{x). 
Connections with Ramanujan sums and character sums are discussed. 

1 Introduction 

Let p be a prime (indeed, throughout this note the letter p will be used to indicate 
primes). Let g he a non-zero rational number. By i^pig) we denote the exponent 
of p in the canonical factorisation of g. If i^p{g) = 0, then by ordg(p) we denote 
the smallest positive integer k such that g'' = l(mod p). If k = p — 1, then g is 
said to be a primitive root mod p. If is a primitive root mod p, then g^ is a 
primitive root mod p iff gcd(j,p — 1) = L There are thus ^p{p — 1) primitive roots 
mod p in (Z/pZ)*, where ip denotes Euler's totient function. 

Let 7r{x) denote the number of primes p < x and Hgix) the number of primes 
p < X such that g is a primitive root mod p. Artin's celebrated primitive root 
conjecture (1927) states that if g is an integer with > 1 and g is not a square, 
then for some positive rational number Cg we have TTg{x) ~ CgA7T{x), as x tends 
to infinity. Here A denotes Artin's constant 



^ = n(l-^(^) =0-3739558136. 



Hooley [3 , under assumption of the Generalized Riemann Hypothesis (GRH), 
established Artin's conjecture and explicitly evaluated Cg. 

It is an old heuristic idea that the behaviour of TTg{x) should be mimicked by 
Hi{x) = J2p<xV{P ~ " 1)) the idea being that the 'probability' that g is a 
primitive root mod p equals ^p{p— l)/{p— 1) (since this is the density of primitive 
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roots in {Tj/pTj)*). Using the Siegel-Walfisz theorem (see Lemma Q below), it is 
not difficult to show, unconditionally, that Hi{x) ~ Ati^x). Although true for 
many g and also on average, it is however not always true, under GRH, that 
Hg{x) ~ Hi{x), i.e., the heuristic Hi{x) is not always asymptotically exact. Nev- 
ertheless, Moree found a quadratic modification, H2{x), of the above heuristic 
Hi{x) involving the Legendre symbol that is always asymptotically exact (assum- 
ing GRH). 

A prime p is said to divide a sequence S of integers, if it divides at least one 
term of the sequence S (see [T] for a nice introduction to this topic). Several 
authors studied the problem of characterising (prime) divisors of the sequence 

+ Hasse j2I seems to have been the ffist to consider the Dirichlet den- 

sity of prime divisors of such sequences. Later authors, e.g., Odoni [Hj and Wierte- 
lak [TT] strengthened the analytic aspects of his work. The best result to date, 
in the formulation of seems to be as follows (recall that Li(a;) = dt/logt 
is the logarithmic integral): 

Theorem 1 Let a andh he non-zero integers. Putr = a/b. Assume thatr ^ ±1. 
Let A be the largest integer such that \r\ = , with u a rational number. Let 
e = sign(r) and L = Q(a/u)- We have 

,r / N r/ N-r./ N ^ / 3; ( lOg lOg x) ^ A 

V log x / 

where the implied constant may depend on a and b and S{r), a rational number, 
is given in Table 1. 

Table 1: The value of 6{r) 



L 


A 


e = +l 


E = -l 


L ^ QiV2) 


A > 


21-V3 


1 - 2-^/3 


L = Q(V2) 


A = 


17/24 


17/24 


L = Q(v^) 


A = 1 


5/12 


2/3 


L = Q{V2) 


A > 2 


2-^/3 


1 - 2-^-^/3 



Starting point in the proof of Theorem ^ is the observation that p \ 2ab divides 
the sequence {a'' + b''}'^^i iff OTdr{p) is even, where r = a/b. The condition that 
ordr(p) be even is weaker than the condition that ordr(p) = p — 1 and now the 
analytic tools are strong enough to establish an unconditional result. 

Note that S{r) does not depend on e in case A = 0. For a 'generic' choice of a 
and 6, L will be different from Q{\/2) and A will be zero and hence 6{a/b) = 2/3. 
It is not difficult to show jH] that the average density of elements of even order 
in a finite field of prime cardinality also equals 2/3. 

In this note analogs of H^l{x) and H^l{x) of Hi{x) and H2{x) will be in- 
troduced and it will be shown that H^^l{x) is always asymptotically exact. This 
leads to the following main result (where 7r(x; fc, I) denotes the number of primes 
p < X satisfying p = Z(mod k) and {*/p) denotes the Legendre symbol): 
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Theorem 2 Let a and b be non-negative natural numbers. Let Na^i,{x) count the 
number of primes p < x that divide some term a^+b^ in the sequence {a^+b^}'^^^. 
Put r = a/b and e = sgn{a/b). Assume that r 7^ ±1. Let h be the largest integer 
such that \r\ = Tq for some tq G Q and h>l. Put e = i'2{h). If e = 1, then 



p<x, (rQ/p) = l 
"2 (P-I)>e 



log X 



and if e = —1, then 



p<x,(rQ/p) — — 1 P^^) i'^o/p) — ^ ^ 



''2(P-l)=s+l i/2(P-l)>s+l 

where the implied constants depend at most on a and b. 



2 Preliminaries 

The proof of Theorem |2l requires a resuh from analytic number theory: the Siegel- 
Walfisz theorem, see e.g., fOl Satz 4.8.3]. For notational convenience we write 
(a, b) instead of gcd(a, b). 

Lemma 1 Let C > be arbitary. There exists Ci > such that 

7r{x;k,l) = ^ + 0ixe-'^^^), 

uniformly for 1 < k < log'^x, (/, /c) = 1, where the implied constant depends at 
most on C . 

Our two heuristics will be based on the following elementary observation in group 
theory. 

Lemma 2 

1) Let h > 1 and w > be integers. Let G be a cyclic group of order n. Let 
= {g^ -.ge G} and G^ = {g'' : z/2(ord(^'^)) =w}. We have = n/(n, h) 
and #6*0 ~ 2~'^'^^"'^^^''^^^n/(n, h) . Furthermore, for w > 1, we have 



h ^ j 2'"-i-'^2{n/(n,ft))^/(^^^ U2{n/{n, h)) > w; 

otherwise. 



2) If i'2{h) > V2{f^), then every element in G^ has odd order. If V2{h) < i'2{n), 
then G^ C G^^. 

3) We have 

^/GV^'^ ifi^2in) = iy2ih) + l; 



G^'' ^/^/2(n) > z/2(/i) + l. 
If ^2{.n) < i'2{h), then G^ is empty. 



3 



Proof. 1) Let go be a generator of G. On noting that g^^ = g^^ iff mi = 
7722 (mod n), the proof becomes a simple exercise in solving linear congruences. 
In this way one infers that = {g^^ : 1 < k < n/{n, h)} and hence ^^G^ = 
n/ {n, h). Note that ord((7Q'^) is the smallest positive integer m such that n/{n, h) 
divides mk. Thus OTd{gQ^) will be odd iff i'2{k) > h'2{n/{n,h)). Using this 
observation we obtain that 



hk - . , - n , n 



and hence #G'[f = 2-'^2(n/(n,ft))^/(^^ Similarly 

(n, /i) (n, /i) 

and hence we obtain (jT)). 

2) If (^) < ^2{h), then using (j2)) we infer that 



where we have written m = 2k and used that {n, 2h) = 2{n, h). 

3) Similar to that of part 2. □ 

Remark. Note that G^ and Gq with the induced group operation from G are 
actually subgroups of G. 



3 Two heuristic formulae for Na^},{x) 

In this section we propose two heuristics for iVa b(x); one more refined than the 
other. Starting point is the observation that if p f 2ab divides the sequence 
{a^ + ^'^IfcLi if aiid only if ordr.(p) is even, where r = a/b. Let h be the largest 
integer such that we can write |r| = Tq with tq a rational number. Let e = sgn(r). 

We will use Lemma H in the case G = Gp:= (Z/pZ)* = F^. The first heuristic 
approximation we consider is 

p<x, pflab " P 

where K^l (x) is supposed to be an heuristic for the number of primes p < x such 

that ordr(p) is odd. From our results below it will follow that lim^^oo K^l (x) /vr(x) 
exists. Note that in case h = 1, this limit is the average density of elements of 
odd order (if e = 1), respectively of order congruent to 2(mod 4) (if e = — 1). 
For a more detailed investigation of the average number of elements having order 
= a(mod d) vide [H|. 

Suppose that p | 2ab. By assumption r G eGp. In the case e = 1, the latter set 
has T^Gp o elements having odd order and so, in some sense, ^Gp^^/^Gp is the 

probability that ordr(p) is odd. This motivates the definition of K^l{x) in case 
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e = 1. In case e = — 1 we use the observation that for p is odd, —Tq has odd order 
iff Tq has order congruent to 2(mod 4). Thus the elements in —G^ of odd order 
are precisely the elements having order 2(mod 4) in and hence have cardinal- 
ity On using part 1 of LemmaElwe infer that K^^li^) = Sp<x, p\2ab ^i!b(p) 
with 

/ (l + e)/2 if z/2(p-l) <e; 
l^aAP) - \ 2^--2(P-i) if z/2(p - 1) > e. 

An heuristic H^^l{x) for Na^x) is now obtained on merely setting H^l{x) = 
7r(x) — K^l{x). Put uj{n) = J2p\n ^- using Q we then infer that 

H^^l{x) = tt{x; 2'+\ 1) - 2^ V 2-"^'^^-^^ + 0{uj{ab)). 



if e = 1 and 



iV2(p-l)>e 

if e = -1. 

In the context of (near) primitive roots it is known that the analoga of H^l (x) 
do not always, assuming GRH, exhibit the correct asymptotic behaviour, but 
that an appropriate 'quadratic' heuristic, i.e. an heuristic taking into account 
Legendre symbols, always has the correct asymptotic behaviour jSl El El (iu |Z1 
the main result of [E] is proved in a different and much shorter way). With this 

(2) 

in mind, we propose a second, more refined, heuristic: H^l{x). 

If t'p(r) = we can consider \r\ = and tq as elements of Gp. We write 
{tq/p) = 1 if tq is a square in Gp and (tq/p) = —1 otherwise. 

First consider the case where e := sgn(r) = 1. If z/2(p — 1) < e := z/2(/i), then 
r has odd order by part 2 of Lemma |21 If i^2{p — 1) > ^2{h) and {j-q/p) = — 1, 
then r G Gp, but r ^ Gp^ (by part 2 of Lemma |21 again) . It then follows that r 
has even order. On the other hand, if {ro/p) = 1 then r G G^^. This suggests to 
take 



K»= E 1+ E lif. 



p<x, U2(p—l)<e P<^, ('■o/p)=i P 

■'2(P-l)>s 



where furthermore we require that p \ 2ab. A similar argument, now using part 
3 instead of part 2 of Lemma leads to the choice 



a,by-^) 4LQ2h ^ 4^G'^^'' 

p<x, (rQ/p) = -l P P<x, (ro/p) = l jP 

>'2(P-1) = '! + 1 I/2(P-1)>': + 1 



in case e = —1, where again we furthermore require that p \ 2ab. We obtain 

<i(^) = E,<.,pt^a.<i(p),with 

(l + e)/2 ifi/2(p-l)<e; 
fcf )(p) = <! (1 + e(^))/2 if u,{p - 1) = e + 1; (4) 

:i + (m)2^~-^(P-^) if z/2(p - 1) > e + 1. 
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Now we put Hj^l(x) = 7r{x) — Kj^l{x) as before. On invoking Lemma El H^l{x) 
can then be more explicitly written as 

hI^I{x) = 7r{x; 2^+1, 1) - 2^+^ ^^^'^"'^^ + 0{uo{ah)), (5) 



p<x, (ro/p) = l 
"2 (P-I)>e 



if e = 1 and 



H^S^x)=^{x)- J2 1-2'^^' E 2--(^-^) + 0(^(a6)), (6) 



p<a;, (rg/p) = -l P<x, {rQ/p) = l 

■'2(P-l) = e+l i/2(P-l)>s+l 



if e = -1. 



4 Asymptotic analysis of the heuristic formulae 

In this section we determine the asymptotic behaviour of H^l {x) and H^l (x). We 
adopt the notation from Theorem |21 and in addition write D for the discriminant 
of QiVr^)- Note that D > 0. 

Theorem 3 Let A > be arbitrary. The implied constants below depend at most 
on A. 

1) We have 



H^J^lix) = 6i{r)U{x) + O(xlog-^x) + 0{u}{ab)), 
SAr) - 



where 



21-/3 ^fe = +l; 
1-2-73 z/e = -l. 

In particular, if L ^ Q{y/2), then H^l{x) is an asymptotically exact heuristic for 

Na,bix). 

2) We have 

H^^lix) = 5(r)Li(a;) + 0(D=^x log"^ x) + 0{uj{ab)). 

In particular, H^l{x) is an asymptotically exact heuristic for Na^h{x). 

The proof of part 2 requires a few facts from algebraic number theory, the proof 
of part 1 does not even require that and is an easier variant of the proof of part 2 
(and is left to the interested reader). The proof of part 2 rests on a few lemmas. 

Lemma 3 Let n be a non-zero integer and K = Q(v^) quadratic number field 
of discriminant A. Let A > 1 and C > be positive real numbers. Then 

E ^=Li(x)( I - ^ ]+^ '^'"^ 



p<a^, (ri/p) — 1 



uniformly in k with k satisfying 2'^"^^|A| < log'^x, where the implied constant 
depends at most on A and C . 
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Proof. By quadratic reciprocity a prime p satisfies (n/p) = 1 iff p is in a certain 
set of congruences classes modulo 4|A|. Thus the primes we are counting in our 
sum are precisely the primes that belong to certain congruences classes modulo 
2^+^|A|, but do not belong to certain congruence classes of modulus 2'^"'"^|A|. The 
total number of congruence classes involved is less than 8|A|. Now apply Lemma 
^ This yields the result but with an, as yet, unknown density. 

On the other hand, the primes p that are counted are precisely the primes 
p < X that split completely in the normal number field ^^'(^2'=), but do not split 
completely in the normal number field K{(2''+^)- If M is any normal extension 
then it is a consequence of Chebotarev's density theorem that the set of primes 
that split completely in M has density 1/[M : Q]. On using this, the proof is 
completed. □ 

Lemma 4 Let m be fixed. With the notation as in the previous lemma we have 



P<x, {n/p)—l k=m ^ 

where the implied constant depends at most on A. 
Proof. We have 



1 1 \ / A^x 



[ir(C2.):Q] [ir(C2^+i) :Q]; Vlog^x 



mi 



2-^^(^^-i) = 5^ 2-'= + 0( 

p<x, (n/p) — 1 k=m P<x, (n/p) — 1 



X 

4™i 



where we used the trivial bound J2p<x U2ip-i)>mi ^ '^^^^ ~ 0(x/4™^). Choose 
mi to be the largest integer such that 2™'i+^|A| < log*" a;. Apply Lemma El with 
any C > A/2. It follows that 



X 



"2 (p-l)>ni 

oo 



Lifx) 



k=m 



^ 2^ ([i^(C20 : Q] [^(C2^+0 : Q]) ^ ^^4^1 



X 



where we used that ^{2^) < [K{(2f=) ■ Q] < 2^5(2'=). On noting that 0(x/4™i) = 
0(A^a; log""^ x), the result follows. □ 

Lemma 5 We have H^^l{x) = 62{r)Li{x) + 0{D^xlog~^ x) + 0{uj{ab)), where 



if e = 1 and 

S2{r) = 1 



2fc+i, 



(7) 



2-+^ [L(C2e+0:Q] [L{C2^+^: 

1 



|i(C2> 



+ 1 , 
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Proof. This easily follows on combining the previous lemma with equation (0), 
respectively Q 

Remark. From ((7j) and (jH)) we infer that 
S2{-\r\) - 6,{\r\) = 1 - ^ + 



The number S2{r) can be readily evaluated on using the following simple fact 
from algebraic number theory: 

Lemma 6 Let K be a real quadratic field. Let k > 1. Then 



\T^(/- \ 01-/2^ if k <2 or K ^ z.), 
[K[Q,.) : - I 2.-1 ^fk>3andK = Q{V2). 



Proof. If is a quadratic field other than Q(v2) then there is an odd prime 
that ramifies in it. This prime, however, does not ramify in Q(C2"), so in this 
case K and Q(a/2) are hnearly disjoint. Note that Cs + Cs^^ = and hence 
Q(v^) C Q(C8)- Using the well-known result that [Q(C„) : Q] = ^(n), the result 
is then easily completed. □ 

The result of this evaluation is stated below. 
Lemma 7 We have S2{r) = S{r). 

After all this preliminary work, it is straightforward to prove the two main results 
of this note: 

Proof of Theorem^ 1) Left to the reader. 2) Combine the latter lemma with 

T^j m ('2') 

LemmaEl Comparison with Theorem ^ shows that W^l^x) ~ Na^b{,x) as x — > cxo 
and thus H^l{x) is an asymptotically exact approximation of Na^b{x). □ 

Proof of Theorem^ Combine part 2 of Theorem El (with any A > 3), Theorem 
[Hand equations © and (jH))- Q 

5 Two alternative formulations 

5.1 An alternative formulation using Ramanujan sums 

Recall that the Ramanujan sum Cn{m) is defined as ^i<fc<„ (j!, e^'"*'^'"/"'. It 
is well-known that Cn{m) G Z and, more in particular, that 

/i(n/ (n, m)) 



c„(m) = ip{n) 



ip{n/ (n, m)) 



This is known as Holder's identity. It implies that c„(m) = c„((n, m)). For our 
purposes the following weak version of Holder's identity will suffice: 

if z/2(t) < v; 

C2.(t) = { -^{2^) iiu2{t)=v-l; (9) 
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Another elementary property of Ramanujan sums we need is that for arbitrary 
natural numbers n and m 

— 'S^Cd{m) — \^ (10) 
n ^ I otherwise. 

d\n 

Suppose that t'p(r) = 0, then ordr.(p)[F* : (r)] = p — 1. Note that ordr(p) is off 
iff 2''2(P-i)|[F* : (r)]. Using identity (HU)) it then follows that 

N^,{x)=^{x)- Yl 2-^^'^""'^ E C2.([F;:(r)]) + OMa6)). (11) 

p<x, p\2ab v<U2{p—l) 

Corollary below shows that if in the latter double sum the summation is re- 
stricted to those V satisfying in addition v < e, respectively v < e + 1, then 
Kj^l{x), respectively Kj^l{x) is obtained. This in combination with Theorems ^ 
and El leads to the following theorem: 

Theorem 4 We have, in the notation of Theorem^ Na^b{x) = 

p<x, p\2ab 2"|(p-l,2fc) \ & / 

and 

p<x, p^ab e+2<.v<.V2{p^l) 

where the implied constant depends at most on a and b. 
Remark. Note that v < min(z/2(p — 1), e + 1) is equivalent with 2^\{p — 1, 2h). 

Lemma 8 Let a, b, e and e be as in Theorem^ and let p \ 2ab. 

1) We have 

2--(^-^) E c,.i[¥;:{r)]) = k^^l{p). 

2) We have 

t)<min(i'2(p— l),e+l) 

Corollary 1 For 1 < j < 2 we have 

J2 2--(^-i) E cArp--{r)]) = Ki^}{x). 

p<x, p\2ab D<min(i/2(p~l),e+j — 1) 

Proof of Lemma IHl 1) Let us consider the case e = — 1 and z/2(p — 1) > e (the 
remaining cases are similar and left to the reader). Since (— 1)(p~i)/2'' = l(mod p) 
we see that —1 and hence r is a 2'^th-power mod p and thus i^2([F* : (r)]) > e. 
Hence the sum in the statement of the lemma reduces to 2~'^^^p~^^ Ylv<e V^(2*') ~ 
2e-u2ip-i) ^ k^^J^^p)^ where ©, © and the identity J2d\nfid) = n are used. 
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x(log logx) 
log^ X 



2) The case z^2(p — 1) < e. The quantity under consideration agrees with that of 

part 1 and by (jH) we obtain that k^^lip) = (1 + e)/2 = k^a},ip)- 
The case z/2(p — 1) = e + 1. Now 

(— 1) 2=+! = l(mod p), (rg) 2=+i = ( — )(mod p) and hence r 2=+i = e( — )(mod p). 

p p 

It follows that z/2([F; : (r)) > e + 1 if e(^) = 1 andz/2([F; : (r)]) = eif e(^) = -1. 
Using the quantity under consideration is seen to reduce to 



,v<e ^ 



By (jH) this equals k^l{p)- 

The case i'2{p — 1) > e + 1. Now r^^^-*^^/^^^" = (y)mod p). Proceeding as before 
the quantity under consideration reduces to 



e-j/2(p-l)Q _|_ 

P 



5.2 An alternative formulation involving character sums 

Let G be a cyclic group of order n and g E G. It is not difficult to show 
that, for any d\n, Y.ord{x)=dXi9) = Cdi[G : {g)]). Using this and noting that 
x(^) = x(^)x'^(^o)) equation (fTT|) can be rewritten as 

iV„,,(x) = 7r(x) - Yl 2"''^'"'^ E X{e)x\ro) + Oiujiab)), (12) 

p<a;, pt2afe ord(x)|2''2(P-i) 

where the sum is over all characters of F* having order dividing 2'^^^^~^\ Note 
that if X is of order 2^, then x'^ is the trivial character if w < e and a quadratic 
character if f = e + 1. If in the main term of ()12|1 only those characters of order 
dividing h are retained, i.e. those for which x^ is the trivial character, then 
H^l{x) is obtained (this is a reformulation of part 1 of Lemma |H)) and hence, 
by part 1 of Theorem El the naive heuristic. If in (fT^ only those characters of 
order dividing 2h are retained, i.e. those for which x^ is the trivial or a quadratic 
character, then the asymptotically exact heuristic is obtained. The error term 
assertion in Theorem |3] can be reformulated as: 

Proposition 1 We have 



p<x, p\2ah 2e+2|ord(x)|2'"2(P-i) \ 6 / 



where the implied constant depends at most on a and b. 

In the setting of near primitive roots it is already known that for the main term 
of the counting function of (near) primitive roots only the contributions coming 
from characters that are either trivial or quadratic need to be included jO]. 
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6 Conclusion 



There is a naive heuristic for Na^i,{x) that in many, but not all, cases is asymp- 
totically exact. There is a quadratic modification of this heuristic involving the 
Legcndre symboU that is always asymptotically exact. The same phenomenon is 
observed (assuming GRH) in the setting of Artin's primitive root conjecture. 
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